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Abstract 

High-gain nonlinear observers occur in the nonlinear automatic control 
theory and are in standard usage in chemical engineering processes. We 
apply such a type of analysis in the context of a very simple one-gene reg- 
ulation circuit. In general, an observer combines an analytical differential- 
equation-based model with partial measurement of the system in order to 
estimate the non-measured state variables. We use one of the simplest 
observers, that of Gauthier et al., which is a copy of the original system 
plus a correction term which is easy to calculate. For the illustration 
of this procedure, we employ a biological model, recently adapted from 
Goodwin's old book by De Jong, in which one plays with the dynamics 
of the concentrations of the messenger RNA coding for a given protein, 
the protein itself, and a single metabolite. Using the observer instead of 
the metabolite, it is possible to rebuild the non-measured concentrations 
of the mRNA and the protein. 
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1 Introduction 



According to textbooks, gene expression is a very complicated dynamical pro- 
cess which is regulated at a number of its stages during the synthesis of proteins 
PQ. Similar to many big cities, with heavy traffic, biological cells host compli- 
cated traffic of biochemical signals at all levels. At the nanometer scale, clusters 
of molecules in the form of proteins drive the dynamics of the cellular network 
that schematically can be divided into four regulated parts: the DNA or genes, 
the transcribed RNAs, the set of interacting proteins, and the metabolites. 
Genes can only affect other genes through specific proteins, as well as through 
some metabolic pathways that are regulated by proteins themselves. They act 
to catalyze the information stored in DNA, all the way from the fundamental 
processes of transcription and translation to the final quantities of produced 
proteins. 

For the purpose of modeling, it is essential to generate simple models that 
help to understand elementary dynamical components of these complex reg- 
ulatory networks as molecular tools that participate in an important way in 
the machinery of cellular decisions, that is to say, in the behavior and genetic 
program of cells. The central importance of control theory in Biology can be 
assessed through the recent problem of identifying control motifs (or modules), 
which are patterns that occur in a gene network far more often than in random- 
ized networks of biological regulators [5] . This hot issue has been first pinpointed 
in a breakthrough paper of Doyle and collaborators [3] in which the regulation 
of bacterial chemotaxis was interpreted in terms of the simple integral control 
'adaptive module' introduced by Barkai and Leibler 0j. Since gene regulation 
appears to occur only at some definite states of the whole process, which in 
general are not well known, we are from the point of view of control engineering 
in the case of the reconstruction of those specific states under the condition of 
limited information. 

It is quite clear that the availability of all state variables to direct measure- 
ment is an extremely rare occasion for gene expression phenomena or when it is 
possible it could be too expensive. For this particular task, but in completely dif- 
ferent technological areas, the engineers have developed software sensors (state 
observers) that accurately reconstruct the state variables of various technologi- 
cal processes [S]. The basic concept of state of a system or process could have 
many different empirical meanings in biology. For the particular case of gene 
expression, the meaning of a state is essentially that of a concentration. The 
typical problem in control engineering that appears to be tremendously useful in 
biology is the reconstruction of some specific regulated states under conditions 
of limited information. 

In general, an observer is expected to provide a good estimate X(t) of the 
natural state X(t) of the original system. For this, one usually can think that 
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some distance d 



X(t),X(t)) (in the sense of 



a norm || ■ || in a vectorial space) 



goes to zero as t — ► oo. Such softwares can be constructed using the mathe- 
matical model of the process to obtain an estimate X of the true state X. This 
estimate can then be used as a substitute for the unknown state X. The usage 
of state observers has proven useful in process monitoring and for many other 
tasks. The concept of observer is used herein in the sense of control theory, defin- 
ing an algorithm capable of giving a reasonable estimation of the unmeasured 
variables of a process. In the case of gene expression processes the description 
is made very concrete in the following by looking at quite simple mathematical 
models that refer to single gene cases and which in principle can be extended 
to some operons that are single gene clusters. 

In this paper we will examine in detail a particularly simple observer due 
to Gauthier and collaborators [6] possessing arbitrary exponential decay and 
linear error dynamics for the case of a three-state genetic regulation process. 
We were led to consider this observer because of its simplicity and its high gain 
property. The gain is defined as the amount of increase in error in the dynamics 
of the observer. This amount is directly related to the velocity with which the 
observer recovers the unknown signal. For the observer of Gauthier et al the 
amount of increase in error is constant and usually of high values leading to a 
fast recover of the unmeasurable states. 



2 Mathematical Model for a Simple Gene Reg- 
ulation Process 



A kinetic model of a simple genetic regulation process was first developed by 
Goodwin as long ago as 1963 7\. It has been further generalized by Tyson and 
Othmer [8] and clearly explained by De Jong in his recent review [5] . We consider 
here the most simple version of this kinetic model. For three concentrations 
X\, X2, X3, corresponding to the messenger RNA (mRNA) that codes for the 
unstable enzyme, the enzyme, and the metabolite, respectively, we write Tyson's 
model in the form 



The parameters K\, K2, K3 are all strictly positive and represent production 
constants, whereas 71,72,73 are also strictly positive degradation constants. 
These rate equations express a balance between the number of molecules ap- 
pearing and disappearing per unit time. Notice that the model assumes that the 
concentration X2 increases linearly with X\ and the concentration X3 linearly 
with X2, which are natural assumptions. In the case of X%, the first term is 
the production term involving a nonlinear nondissipative regulation function H 




(1) 
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that we take of the m-steepen Hill form (to > is the steepness parameter) in 
common use 



H+ (X, 0) = 2 



V"71 

H-(X,&) = 1- 3 



(2) 



Xg 1 + & 



for the activation and inhibition cases, respectively. The parameter 9 gives the 
threshold for the regulatory influence of the concentration of the metabolite on 
the target gene, whereas the steepness parameter to is a measure of the col- 
lective effect of groups of metabolite molecules and also defines the shape of 
the Hill curve. This nonlinear parametrization describes the 'biological regula- 
tion process' that includes the production of the mRNA by transcription of its 
structural gene, its possible intranuclear processing by cleavage, its enzymatic 
degradation within the nucleus, and its migration to the cytoplasm by some 
form of diffusion or biological transport. Once in the cytoplasm, the mRNA is 
both translated into the unstable enzyme and enzymatically degraded. 

System Tbioiogy and its trivial chain generalization in the linear part is con- 
sidered to be a good model for the simplest type of allosteric regulation in 
biochemistry, i.e., the inhibition or activation of an enzyme or protein by a 
small regulatory molecule that interacts with the enzyme at a site (allosteric 
site) other than the active site at which catalytic activity occurs. The inter- 
action changes the shape of the enzyme, thus affecting the active site of the 
standard catalysis. This change of shape of the enzyme is sufficient to change 
its ability to catalyze a reaction in either negative or positive way and enables 
a cell to regulate needed metabolites. The allosteric regulation has the typical 
features of a feedback loop in control theory if the regulatory protein acts on 
the enzyme in the pathway of its own synthesis. 



3 The Nonlinear Observer 

Many attempts have been made to develop nonlinear observer design methods. 
One could mention the industrially popular extended Kalman filter, whose de- 
sign is based on a local linearization of the system around a reference trajectory, 
restricting the validity of the approach within a small region in the state space 
[5] , [10] . The first systematic approach for the development of a theory of nonlin- 
ear observers was proposed some time ago by Krener and fsidori [llj . In further 
works, nonlinear transformations of the coordinates have also been employed to 
put the considered nonlinear sytcm in a suitable "observer canonical form" , in 
which the observer design problem may be easily solved [5].[12j.[I5]. The main 
idea in this case is to find a state transformation to represent the system as 
a linear differential equation plus a nonlinear term, which is a function of the 
measured state. 
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In this section, we present the design of a nonlinear software sensor in which 
the metabolite concentration is the naturally measured state (the most easy to 
measure) and corresponds to the mathematical state X3 in the model introduced 
in the previous section. Therefore, it seems logical to take X3 as the output of 
the system 

y = h(X) = X 3 . (3) 

We now apply the technique of high-gain observers that works for many non- 
linear systems and guarantees that the output feedback controller recovers the 
performance of the state feedback controller when the observer's gain is suffi- 
ciently high. The model given by the aforementioned system Tbioiogy has the 
form 



V = h(X) , W 

in which X € R 3 , and moreover there is a "physical subset" C I 3 where 
the system lies. To make this mathematically precise we must introduce some 
further mathematical terminology. Let us construct the jth time derivative of 
the output. This can be expressed using Lie differentiation of the function h by 
the vector field /, L f j (h) (X(t)). L f j (h) (X(t)) is the jth Lie derivative of h 
by / and a function of X defined inductively as follows 



L f °(h)(X) = h(X) 

(5) 

W{h){X) = ±-{ L r l {h){X))f{X). 
When T y is observable, the map $ : X — > $>(X) is a diffeomorphism where 



/L f °(h)(X)\ ~ X 3 
£ = $(X)= Lf 1 (h) (X) = ^3*2-73*3 

\ Lf 2 (h) (X) J [ K 3 [KiXi - 72 X 2 ) - 73 (K3X2 - 73^3) . 

For &(X) to be a local diffeomorphism in a region 17, it is necessary and 
sufficient that the Jacobian d$(X) should be nonsingular on £1 and moreover 
that &(X) is one-to-one from f2 to $(£7), see [M]. Notice, that no matter if 
we choose H + (X, ??) or H~ (X, ■&), the coordinate transformation is the same. 
This means that the structure of the observer will be the same for both cases: 
gene activation or inhibition. 



When the system is observable on f2, it can be rewritten in the global coor- 
dinate system defined by <&(X) in the following matrix form: 



r:, 
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. is _ 




p(0 



1/ = c c = [1 0] 
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(7) 



where, moreover, </? can be extended from Q to the entire R 3 by a C°° function 
globally Lipschitz on R 3 . The latter form allows us to make use of the following 
result proven by Gauthier and collaborators [6]: 



Consider the system 

r G : £ = F , (i)-s- 1 d T (c£-v) , 

where S(6) is the solution of the matrix equation 



6S - A T S -SA + C T C 







(8) 



(9) 



for 6 large enough, with A a matrix of Brunovsky form ( A = Sij+i; Sij is the Kro- 
necker symbol), which plays the role of a shift operator on R n . Then, Eq. 0} defines 
an observer for T'y, with 



5 - £ < M exp 



--t 



(10) 



In our case, an observer is a dynamical system as given by Eq. |5| that Hill 
track the trajectory of the original system (here T' y ). Notice that both systems 
are identical unless an additional term that compensate the error in the ob- 
server, where the error is given by the difference ||£ — £||, which is seen to be 
exponentially decreasing in time. The Gauthier observer is particularly simple 
since it appears to be only a copy of T' y , together with a correction term that 
depends only on the dimension of the state space and not on the system V 
itself. In others words, the structure of the observer does not depend on the 
Hill steepness parameter m (Eq. (Q])). 

For the sake of concreteness we will construct the observer only for the 

activation case. However, one should notice that only the function / (x) will 

change for the inhibition case. 
The Gauthier observer in Eq. 



X 



in the original coordinates is given by 
T ( X ) $- l C T (h(X) -h(x\\ , (11) 



where 



T [X = 



(12) 



=*(«) 



G 



For the particular three-dimensional state space of Tbioiogy we get 



T (X = 



72 73 

K 2 K 3 

73 

K s 



72+73 

K 2 K 3 
1 

K 3 




if 2 ^ 3 






(13) 



The matrix S(9) in the three dimensional case can be easily computed by 
means of Eq. (J9j) given in the Gauthier theorem and its inverse S~ 1 (9) appears 
to be 



S-\0) = 



3 9 3 9 2 9 3 
3 9 2 5 9 3 2 6> 4 
9 3 2 6» 4 9 5 



(14) 



Plugging the matrices (jT3j) and (jT4j) in Eq. (jlip . we get the following equation 
for the observer introduced by Gauthier and collaborators as applied to our 
biological case 



X 



biology 



/(*) 



O 72 73 A 
° -R"2-R"3 



o (72+73)8 
° K 2 K a n 



2 J38 
K3 



K 2 K 3 



36» 



(x 3 - X 3 ) 



(15) 



We use this form of the Gauthier observer to estimate the states Xi and X 2 
of the dynamical system r D ioiogy We work with 9 = 1 and the values of the 
parameters given in Table 1 that are not necessarily the experimental values but 
are consistent with the requirements of the model. Figure ([T]) shows the results 
of a numerical simulation, where the solid lines represent the true states and 
the dotted lines stand for the estimates, respectively. In addition, for the real 
system we have taken m = 2 whereas for the observer m = 1 in order to show 
the robustness of this type of nonlinear observer with respect to the steepness 
parameter. 



Table 1: Variables and parameters 


Symbol 


Meaning 


Value (arb. units) 


Ki 


Production constant of mRNA 


0.001 


K 2 


Production constant of protein 


1.0 


K 3 


Production constant of metabolite 


1.0 


7i 


Degradation constant of mRNA 


0.1 


72 


Degradation constant of protein 


1.0 


73 


Degradation constant of metabolite 


1.0 


d 


Hill's threshold parameter 


1.0 
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02468 10 02468 10 



time 

Figure 1: The numerical simulation - solid lines represent the true states and 
dotted lines represent the Gauthicr estimates given by Eq. (| 1 5j> for an activation 
case. Plot (a) represents the evolution of mRNA concentration in time and plot 
(b) the variation of protein (enzyme) concentration in time. 

4 Conclusion 

We presented here the mathematical exercise of designing a high-gain observer 
for a simple one-gene regulation dynamic process involving end-product activa- 
tion (inhibition leads to similar results), which is able to rebuild in an effective 
way the non-measured concentrations of mRNA and the involved protein. Thus, 
the limitation of those experiments in which one has available only the metabo- 
lite can be overcome by employing this simple observer. In addition, this type 
of nonlinear observer could be used on line and is robust with respect to m, i.e., 
it does not need the exact value of the Hill steepness parameter. However, for 
more complex inputs of more complicated observable dynamical systems, this 
constant gain observer could have less performance and be overcome by some 
adaptive observers that can change in order to work better or provide more fit 
for a particular purpose. In the case of more limited information, e.g., for un- 
known functional form of the regulation function and high noise levels that can 
spoil the performance of the observer, the completely different mathematical 
procedure of creating dynamical extensions of the observer system are required 
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